AITopics | human eye perceptual evaluation

HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Neural Information Processing SystemsDec-25-2025, 11:46:18 GMT

Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We construct Human eYe Perceptual Evaluation (HYPE) a human benchmark that is (1) grounded in psychophysics research in perception, (2) reliable across different sets of randomly sampled outputs from a model, (3) able to produce separable model performances, and (4) efficient in cost and time. We introduce two variants: one that measures visual perception under adaptive time constraints to determine the threshold at which a model's outputs appear real (e.g.

human eye perceptual evaluation, hype, name change, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.60)
Information Technology > Artificial Intelligence > Vision (0.43)

Add feedback

HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Neural Information Processing SystemsMay-27-2025, 11:42:34 GMT

Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We construct Human eYe Perceptual Evaluation (HYPE) a human benchmark that is (1) grounded in psychophysics research in perception, (2) reliable across different sets of randomly sampled outputs from a model, (3) able to produce separable model performances, and (4) efficient in cost and time.

generative model, human eye perceptual evaluation, hype, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.78)
Information Technology > Artificial Intelligence > Machine Learning (0.67)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.64)

Add feedback

Reviews: HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Neural Information Processing SystemsJan-24-2025, 07:02:56 GMT

This paper introduces a framework to evaluate the perceptual realism of samples from generative models. The framework, HYPE- Human Eye Perceptual Evaluation, is based on psychophysics methods. Two different metrics are proposed. The first one, HYPE_time, measures the amount of time a human needs before distinguishing a real from a fake. The metric is clearly defined and very well founded on psychophysics.

generative model, metric, perceptual evaluation, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.68)
Information Technology > Artificial Intelligence > Vision (0.62)

Add feedback

Reviews: HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Neural Information Processing SystemsJan-24-2025, 07:02:44 GMT

The reviewers were unanimous in judging that this is good quality work that tackles a an important and relevant problem for NeurIPS, and that it will attract attention of a wide audience. The rebuttal solidified this viewpoint in the discussions thereafter. Given the enthusiastic reviews, I think this deserves an oral presentation at NeurIPS.

generative model, human eye perceptual evaluation, hype, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.40)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.40)

Add feedback

HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Neural Information Processing SystemsOct-10-2024, 04:38:41 GMT

Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We construct Human eYe Perceptual Evaluation (HYPE) a human benchmark that is (1) grounded in psychophysics research in perception, (2) reliable across different sets of randomly sampled outputs from a model, (3) able to produce separable model performances, and (4) efficient in cost and time.

generative model, human eye perceptual evaluation, hype, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.78)
Information Technology > Artificial Intelligence > Machine Learning (0.67)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.64)

Add feedback

HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

Zhou, Sharon, Gordon, Mitchell, Krishna, Ranjay, Narcomey, Austin, Fei-Fei, Li F., Bernstein, Michael

Neural Information Processing SystemsMar-18-2020, 21:48:13 GMT

Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We construct Human eYe Perceptual Evaluation (HYPE) a human benchmark that is (1) grounded in psychophysics research in perception, (2) reliable across different sets of randomly sampled outputs from a model, (3) able to produce separable model performances, and (4) efficient in cost and time.

generative model, human eye perceptual evaluation, hype, (4 more...)

Neural Information Processing Systems

Technology: